A conditional likelihood is required to estimate the selection coefficient in ancient DNA
نویسنده
چکیده
Time-series of allele frequencies are a useful and unique set of data to determine the strength of natural selection on the background of genetic drift. Technically, the selection coefficient is estimated by means of a likelihood function built under the hypothesis that the available trajectory spans a sufficiently large portion of the fitness landscape. Especially for ancient DNA, however, often only one single such trajectories is available and the coverage of the fitness landscape is very limited. In fact, one single trajectory is more representative of a process conditioned both in the initial and in the final condition than of a process free to visit the available fitness landscape. Based on two models of population genetics, here we show how to build a likelihood function for the selection coefficient that takes the statistical peculiarity of single trajectories into account. We show that this conditional likelihood delivers a precise estimate of the selection coefficient also when allele frequencies are close to fixation whereas the unconditioned likelihood fails. Finally, we discuss the fact that the traditional, unconditioned likelihood always delivers an answer, which is often unfalsifiable and appears reasonable also when it is not correct.
منابع مشابه
Modeling Gold Volatility: Realized GARCH Approach
F orecasting the volatility of a financial asset has wide implications in finance. Conditional variance extracted from the GARCH framework could be a suitable proxy of financial asset volatility. Option pricing, portfolio optimization, and risk management are examples of implications of conditional variance forecasting. One of the most recent methods of volatility forecasting is Real...
متن کاملConditional Dependence in Longitudinal Data Analysis
Mixed models are widely used to analyze longitudinal data. In their conventional formulation as linear mixed models (LMMs) and generalized LMMs (GLMMs), a commonly indispensable assumption in settings involving longitudinal non-Gaussian data is that the longitudinal observations from subjects are conditionally independent, given subject-specific random effects. Although conventional Gaussian...
متن کاملA likelihood method for jointly estimating the selection coefficient and the allele age for time serial data
Recent advances in sequencing technologies have made available an ever-increasing amount of ancient genomic data. In particular, it is now possible to target specific single nucleotide polymorphisms in several samples at different time points. Such time series data is also available in the context of experimental or viral evolution. Time-series data should allow for a more precise inference of ...
متن کاملModified Maximum Likelihood Estimation in First-Order Autoregressive Moving Average Models with some Non-Normal Residuals
When modeling time series data using autoregressive-moving average processes, it is a common practice to presume that the residuals are normally distributed. However, sometimes we encounter non-normal residuals and asymmetry of data marginal distribution. Despite widespread use of pure autoregressive processes for modeling non-normal time series, the autoregressive-moving average models have le...
متن کاملStress scenario selection by empirical likelihood
This paper develops amethod for selecting and analysing stress scenarios for financial risk assessment, with particular emphasis on identifying sensible combinations of stresses tomultiple factors.We focus primarily on reverse stress testing – finding the most likely scenarios leading to losses exceeding a given threshold. We approach this problem using a nonparametric empirical likelihood esti...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 6 شماره
صفحات -
تاریخ انتشار 2016